Search CORE

4 research outputs found

Recommended from our members

Sentiment analysis: text, pre-processing, reader views and cross domains

Author: Haddi Emma
Publication venue: Brunel University London
Publication date: 01/01/2015
Field of study

This thesis was submitted for the award of Doctor of Philosophy and was awarded by Brunel University LondonSentiment analysis has emerged as a field that has attracted a significant amount of attention since it has a wide variety of applications that could benefit from its results, such as news analytics, marketing, question answering, knowledge management and so on. This area, however, is still early in its development where urgent improvements are required on many issues, particularly on the performance of sentiment classification. In this thesis, three key challenging issues affecting sentiment classification are outlined and innovative ways of addressing these issues are presented. First, text pre-processing has been found crucial on the sentiment classification performance. Consequently, a combination of several existing preprocessing methods is proposed for the sentiment classification process. Second, text properties of financial news are utilised to build models to predict sentiment. Two different models are proposed, one that uses financial events to predict financial news sentiment, and the other uses a new interesting perspective that considers the opinion reader view, as opposed to the classic approach that examines the opinion holder view. A new method to capture the reader sentiment is suggested. Third, one characteristic of financial news is that it stretches over a number of domains, and it is very challenging to infer sentiment between different domains. Various approaches for cross-domain sentiment analysis have been proposed and critically evaluated

Brunel University Research Archive

EXACT2: the semantics of biomedical protocols

Author: A Maccagnan
A Pease
A Sackmann
A Sujathaa
Brian B Rudkin
CJ Mungall
Daniel Nadis
Doi
Emma Haddi
Grunwald
H Obokata
I Mura
J Taubert
K Wolstencroft
Larisa N Soldatova
LN Soldatova
LN Soldatova
LN Soldatova
M Courtot
M Hilario
M Schilling
Nigel J Saunders
Piyali S Basu
R Garside
RD King
Ross D King
RR Brinkman
S Mitchell
S Rune
S Shapin
T Bittner
T Klingström
Th Paul
V Rätzel
Véronique Baumlé
W Ceusters
Wolfgang Marwan
Z Xiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

© 2014 Soldatova et al.; licensee BioMed Central. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.This article has been made available through the Brunel Open Access Publishing Fund.Background: The reliability and reproducibility of experimental procedures is a cornerstone of scientific practice. There is a pressing technological need for the better representation of biomedical protocols to enable other agents (human or machine) to better reproduce results. A framework that ensures that all information required for the replication of experimental protocols is essential to achieve reproducibility. Methods: We have developed the ontology EXACT2 (EXperimental ACTions) that is designed to capture the full semantics of biomedical protocols required for their reproducibility. To construct EXACT2 we manually inspected hundreds of published and commercial biomedical protocols from several areas of biomedicine. After establishing a clear pattern for extracting the required information we utilized text-mining tools to translate the protocols into a machine amenable format. We have verified the utility of EXACT2 through the successful processing of previously ‘unseen’ (not used for the construction of EXACT2) protocols. Results: The paper reports on a fundamentally new version EXACT2 that supports the semantically-defined representation of biomedical protocols. The ability of EXACT2 to capture the semantics of biomedical procedures was verified through a text mining use case. In this EXACT2 is used as a reference model for text mining tools to identify terms pertinent to experimental actions, and their properties, in biomedical protocols expressed in natural language. An EXACT2-based framework for the translation of biomedical protocols to a machine amenable format is proposed. Conclusions: The EXACT2 ontology is sufficient to record, in a machine processable form, the essential information about biomedical protocols. EXACT2 defines explicit semantics of experimental actions, and can be used by various computer applications. It can serve as a reference model for for the translation of biomedical protocols in natural language into a semantically-defined format.This work has been partially funded by the Brunel University BRIEF award and a grant from Occams Resources

Goldsmiths Research Online

Crossref

Springer - Publisher Connector

PubMed Central

Brunel University Research Archive

Pre-processing Framework for Twitter Sentiment Classification

Author: A Kanavos
A Kanavos
B Liu
B Pang
DM Blei
DM Blei
Emma Haddi
I Kavakiotis
J Zhao
KP Murphy
Lin Zhang
Q Ye
S García
TL Griffiths
Publication venue: Springer International Publishing
Publication date
Field of study

Part 2: 8th Mining Humanistic Data WorkshopInternational audienceTwitter Sentiment Classification is undergoing great appeal from the research community; also, user posts and opinions are producing very interesting conclusions and information. In the context of this paper, a pre-processing tool was developed in Python language. This tool processes text and natural language data intending to remove wrong values and noise. The main reason for developing such a tool is to achieve sentiment analysis in an optimum and efficient way. The most remarkable characteristic is considered the use of emojis and emoticons in the sentiment analysis field. Moreover, supervised machine learning techniques were utilized for the analysis of users’ posts. Through our experiments, the performance of the involved classifiers, namely Naive Bayes and SVM, under specific parameters such as the size of the training data, the employed methods for feature selection (unigrams, bigrams and trigrams) are evaluated. Finally, the performance was assessed based on independent datasets through the application of k-fold cross validation

Crossref

Explaining the Development of International Norms: The Humanitarian Turn at the United Nations Security Council

Author: Alastair Johnston
Andrew P Cortell
Arthur Spirling
Barry O&apos
Bear F Braumoeller
Bettina Gr�n
Boyi Xie
Brandon M Stewart
C True-Frost
Carol Cohn
Chaim D Kaufmann
Cheryl Shanks
Daniel C Thomas
Daniel Diermeier
David L Bosco
David M Blei
David S Law
David Schweigman
Deerwester Scott
Emma Haddi
Eric A Posner
G Ikenberry
G Ikenberry
Gary J Bass
Gunter Denhert
Harald M�ller
Hun Kim
Ian Hurd
Ian Johnstone
James G March
James Q Wilson
Jeffrey Lewis
Jeffrey T Checkel
Jeffrey T Checkel
Jeffrey W Legro
John Mueller
Jonathan B Slapin
Jonathan Chang
Juan Cao
Julie A Mertus
Jutta Joachim
Kamal Nigam
Kenneth N Waltz
Kevin Simler
Laura J Shepherd
Marc Trachtenberg
Mark H Hansen
Martin Ponweiser
Michael A Bailey
Mona Krook
Oona Hathaway
Oona Hathaway
Oren Liebermann
Rajkumar Arun
Rebecca Adler-Nissen
Richard Hanania
Romain Deveaud
Ruti G Teitel
Samuel Moyn
Steven Pinker
Steven Pinker
Susan Park
Tao Qin
Ted Hopf
Thiago S Guzella
Thomas Risse
Torunn L Tryggestad
Wesley W Widmaier
Yee W Teh
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Crossref